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Amendments to the Claims: 

This listing of claims will replace all prior versions and listings of claims in this 
application: 

Listing of Claims: 

1 .(Currently amended) A processor-implemented method of 
clustering a set of records, each of the records having attribute values for a 
set of attributes, the method comprising: 

for each attribute of the set of attributes, determining a characteristic 
value for said each attribute, based on attribute values of said each 
attribute; 

wherein determining the characteristic value comprises calculating 
the attribute values of said attribute across the records; 

for each attribute value, determining a deviation from the 
characteristic value of said each attribute; 

for each record, sorting the set of attributes based on deviations of 
the attribute values, to provide a key; and 

clustering the set of records based on the key. 

2. (Original) The method of claim 1 , further comprising calculating a 
mean value of the attribute values of said each attribute as the 
characteristic value. 
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3. (Original) The method of claim 1 , wherein a median value of the 
attribute values of said each attribute is determined as the characteristic 
value. 

4. (Original) The method of claim 1 , wherein determining the deviation 
comprises calculating a difference between said each attribute value and 
the characteristic value of said each attribute. 

5. (Original) The method of claim 1, wherein determining the deviation 
comprises calculating a difference between said each attribute value and 
the characteristic value of the corresponding attribute, and dividing the 
difference by the characteristic value of said each attribute. 

6. (Original) The method of claim 1 , wherein sorting the set of attributes 
comprises using absolute values of the deviations of the attribute values as a 
sorting criterion. 

7. (Original) The method of claim 1 , wherein a first record of the set of 
records contains a first key and a second record of the set of records 
contains a second key; and 

further comprising placing the first key and the second key into a 
single cluster if the first key and the second key have identical sub- 
sequences of a first length. 
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8. (Original) The method of claim 1 , wherein a first record of the set of 
records contains a first key and a second record of the set of records 
contains a second key; and 

further comprising placing the first key and the second key into a 
single cluster if the first key and the second key have identical sub- 
sequences of absolute values of the deviations. 

9. (Original) The method of claim 1 , wherein a first record of the set of 
records contains a first key that has a first sub-sequence, and a second 
record has a second sub-sequence contains a second key; and 

further comprising placing the first key and the second key into a 
single cluster if the first and second sub-sequences comprise the same set of 
attributes. 

1 0. (Original) The method of claim 9, wherein the first and second sub- 
sequences comprise the same set of attributes irrespective of a sign of the 
deviations of the attribute values. 

1 1 . (Currently amended) The method of claim 10, further comprising 
refining a clustering result by : 

identifying a cluster having a smallest number of records; a^d 
for each record of the identified cluster searching another cluster 

having records with best matching keys ; and 

distributing the cluster with the smallest number of records to the other 

cluster having records with best matching keys, to reduce the totol number 

of clusters . 
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1 2. (Original) The method of claim 1 1 , further comprising reducing a 
length of the first sub-sequence and a length of the second sub-sequence in 
order to find a best match. 

1 3. (Original) The method of claim 1 2, further comprising using a 
distance measure to find another cluster for a record of the identified 
cluster. 

1 4. (Currently amended) The method of claim 1 3, wherein the 
distance measure comprises a iucMs Euclidean distance. 

15. (Currently amended) A computer program product having 
executable instruction codes that are stored on a computer-usable 
medium, for clustering a set of records, each of the records having attribute 
values for a set of attributes, the computer program product comprising: 

a first set of instruction codes, which, for each attribute of the set of 
attributes, determines a characteristic value for said each attribute, based 
on attribute values of said each attribute; 

wherein the characteristic value is calculated from the attribute 
values of said attribute across the records; 

a second set of instruction codes, which, for each attribute value, 
determines a deviation from the characteristic value of said each attribute; 

a third set on instruction codes, which, for each record, sorts the set of 
attributes based on deviations of the attribute values, to provide a key; and 
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a fourth set of instruction codes for clustering the set of records based 
on the key. 

1 6. (Original) The computer program product of claim 1 5, further 
comprising a fifth set of instruction codes for calculating a mean value of 
the attribute values of said each attribute as the characteristic value. 

1 7. (Original) The computer program product of claim 15, further 
comprising a sixth set of instruction codes for setting a median value of the 
attribute values of said each attribute as the characteristic value. 

18. (Original) The computer program product of claim 15, wherein the 
second set of instruction codes determines the deviation by calculating a 
difference between said each attribute value and the characteristic value 
of said each attribute. 

19. (Original) The computer program product of claim 15, wherein the 
second set of instruction codes determines the deviation by calculating a 
difference between said each attribute value and the characteristic value 
of the corresponding attribute, and by dividing the difference by the 
characteristic value of said each attribute. 

20. (Original) The computer program product of claim 15, wherein the 
third set on instruction codes sorts the set of attributes using absolute values 
of the deviations of the attribute values as a sorting criterion. 
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21 . (Currently amended) A processor-implemented system for 
clustering a set of records, each of the records having attribute values for a 
set of attributes, the system comprising: 

each attribute of the set of attributes comprising a characteristic 
value for said each attribute based on attribute values of said each 
attribute; 

wherein the characteristic value is calculated from the attribute 
values of said attribute across the records; 

each attribute value comprising a deviation from the characteristic 
value of said each attribute; 

each record comprising the set of attributes based on deviations of 
the attribute values, to provide a key; and 

wherein the set of records are clustered based on the key. 

22. (Original) The system of claim 21 , wherein a mean value of the 
attribute values of said each attribute is calculated as the characteristic 
value. 

23. (Original) The system of claim 21 , wherein a median value of the 
attribute values of said each attribute is calculated as the characteristic 
value. 

24. (Original) The system of claim 21 , wherein the deviation is 
calculated as a difference between said each attribute value and the 
characteristic value of said each attribute. 
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25. (Original) The system of claim 21 , wherein the deviation is 
determined by calculating a difference between said each attribute value 
and the characteristic value of the corresponding attribute, and by dividing 
the difference by the characteristic value of said each attribute. 

26. (Original) The system of claim 21 , wherein the set of attributes is 
sorted using absolute values of the deviations of the attribute values as a 
sorting criterion. 



9 



